A Simple Method to Ensure Plausible Multiple Imputation for Continuous Multivariate Data
نویسندگان
چکیده
Multiple Imputation (MI) is an established approach for handling missing values. We show that MI for continuous data under the multivariate normal assumption is susceptible to generating implausible values. Our proposed remedy, is to 1) transform the observed data into quantiles of the standard normal distribution, 2) obtain a functional relationship between the observed data and it’s corresponding standard normal quantiles, 3) undertake MI using the quantiles produced in step 1 and finally 4) use the functional relationship to transform the imputations into their original domain. In conclusion, our approach safeguards MI from imputing implausible values. URL: http://mc.manuscriptcentral.com/lssp E-mail: [email protected] Communications in Statistics Simulation and Computation
منابع مشابه
Missing data imputation in multivariable time series data
Multivariate time series data are found in a variety of fields such as bioinformatics, biology, genetics, astronomy, geography and finance. Many time series datasets contain missing data. Multivariate time series missing data imputation is a challenging topic and needs to be carefully considered before learning or predicting time series. Frequent researches have been done on the use of diffe...
متن کاملAccuracy evaluation of different statistical and geostatistical censored data imputation approaches (Case study: Sari Gunay gold deposit)
Most of the geochemical datasets include missing data with different portions and this may cause a significant problem in geostatistical modeling or multivariate analysis of the data. Therefore, it is common to impute the missing data in most of geochemical studies. In this study, three approaches called half detection (HD), multiple imputation (MI), and the cosimulation based on Markov model 2...
متن کاملST-14 Handling Missing Data with Multiple Imputation Using PROC MI in SAS
The multiple imputation was developed as a general method for inference with missing data. Instead replacing the missing observation with a single value, multiple imputation method replaces each missing value with multiple plausible values. PROC MI in SAS creates multiply imputed data sets for incomplete multivariate data. This study reviews multiple imputation as an analytic strategy for missi...
متن کاملPractice of Epidemiology Multiple Imputation in a Longitudinal Cohort Study: ACase Study of Sensitivity to Imputation Methods
Multiple imputation has entered mainstream practice for the analysis of incomplete data. We have used it extensively in a large Australian longitudinal cohort study, the Victorian Adolescent Health Cohort Study (1992–2008). Although we have endeavored to follow best practices, there is little published advice on this, and we have not previously examined the extent to which variations in our app...
متن کاملMultiple Imputation for Multivariate Missing-Data Problems: A Data Analyst's Perspective.
Analyses of multivariate data are frequently hampered by missing values. Until recently, the only missing-data methods available to most data analysts have been relatively ad1 hoc practices such as listwise deletion. Recent dramatic advances in theoretical and computational statistics, however, have produced anew generation of flexible procedures with a sound statistical basis. These procedures...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Communications in Statistics - Simulation and Computation
دوره 39 شماره
صفحات -
تاریخ انتشار 2010